Fusion of dictionaries in voice creation and speech synthesis task

نویسندگان

  • Tatyana Polyakova
  • Antonio Bonafonte
چکیده

The accurate phonetic transcription is very important for different fields of speech technologies. In speech synthesis, it is the benchmark for the voice segmentation, and therefore one of the crucial points for the synthesized speech pronunciation quality. In ASR the availability of matching phonetic transcription allows a higher recognition precision. Use of different dictionaries could improve the phonetic transcription since it allows a better word coverage, but the “direct” dictionary merging presents incompatibility problems. Dictionary fusion is the method that automatically learns the dictionary-to-dictionary transformation rules. The results presented in this paper show that fusion significantly improves the compatibility between the dictionaries (about 32-83% of improvement), and allows reducing the number of “severe” pronunciation errors in comparison with the graphemeto-phoneme (g2p) techniques.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Voice-based Age and Gender Recognition using Training Generative Sparse Model

Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...

متن کامل

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...

متن کامل

Towards Establishing a Methodology for Benchmarking Speech Synthesis for Computer-Assisted Language Learning (CALL)

1. Introduction In very simple terms, speech synthesis is the process of making the computer talk. As such, speech synthesisers offer another means of providing spoken language input to the learner in CALL environments. Indeed, many potential benefits (ease of creation and editing of speech examples, generation of various kinds of modified input, generation of speech models and feedback on dema...

متن کامل

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

The Relationship Between Acoustic Characteristics and Personality Dimensions in Patients With Dysphonia

Objectives: Voice is influenced by personality. However, it is still questionable which acoustic features are influenced by personality traits. This study aimed to investigate the relationship between acoustic characteristics and personality dimensions. Methods: Thirty-three participants with dysphonia and 33 participants without dysphonia were recruited to take part in this cross-sectional st...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007